AITopics | tea cup

Collaborating Authors

tea cup

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models

Zhao, Juntu, Deng, Junyu, Ye, Yixin, Li, Chongxuan, Deng, Zhijie, Wang, Dequan

arXiv.org Artificial IntelligenceAug-5-2024

Advancements in text-to-image diffusion models have broadened extensive downstream practical applications, but such models often encounter misalignment issues between text and image. Taking the generation of a combination of two disentangled concepts as an example, say given the prompt "a tea cup of iced coke", existing models usually generate a glass cup of iced coke because the iced coke usually co-occurs with the glass cup instead of the tea one during model training. The root of such misalignment is attributed to the confusion in the latent semantic space of text-to-image diffusion models, and hence we refer to the "a tea cup of iced coke" phenomenon as Latent Concept Misalignment (LC-Mis). We leverage large language models (LLMs) to thoroughly investigate the scope of LC-Mis, and develop an automated pipeline for aligning the latent semantics of diffusion models to text prompts. Empirical assessments confirm the effectiveness of our approach, substantially reducing LC-Mis errors and enhancing the robustness and versatility of text-to-image diffusion models. Our code and dataset have been available online for reference.

concept pair, latent concept misalignment, tea cup, (11 more...)

arXiv.org Artificial Intelligence

2408.0023

Country:

Asia > China > Shanghai > Shanghai (0.04)
North America > United States > New York (0.04)
North America > United States > California (0.04)
Asia > China > Shandong Province (0.04)

Genre: Research Report (1.00)

Industry: Consumer Products & Services > Food, Beverage, Tobacco & Cannabis > Beverages (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Faces for kettles: data collection industry flourishes as China pursues AI ambitions

#artificialintelligenceJun-30-2019, 12:43:31 GMT

At the front of the line, a woman stands in front of a camera zip-tied to a tripod. She holds a photograph of her head with the eyes and the nose cut out in front of her face and slowly rotates side to side. Villagers waiting their turn take a numbered ticket. Some of them say it's the third or fourth time they've come to do this sort of work. The project, run out of a sleepy courtyard village house adorned with posters of former China leader Mao Zedong, is collecting material that could train artificial intelligence (AI) software to distinguish between real facial features and still images.

artificial intelligence, china, social media, (10 more...)

#artificialintelligence

Country: Asia > China (1.00)

Industry: Information Technology > Security & Privacy (0.52)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (0.37)
Information Technology > Communications > Social Media > Crowdsourcing (0.33)

Add feedback